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Sir: 

Prior to calculation of the filing fee and prior to the examination of this application, 
please amend the above-identified application as follows: 
IN THE SPECIFICATION 

Please insert new pages 36-52 containing the revised Sequence Listing and 
renumber the pages containing the claims and abstract accordingly. 
IN THE CLAIMS : 

Kindly cancel claim 1 without prejudice or disclaimer. 

Please add the following new claims to the application: 

--1 3. A protein of the TGF-P family encoded by an isolated DNA molecule which 
has a sequence selected from the group consisting of: 

a) the sequence shown in SEQ ID NO:1 , 

b) a part of SEQ ID NO:1 which encodes the mature protein, 
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c) a nucleotide sequence which encodes the amino acid sequence according to 
SEQ ID NO:2, 

d) a nucleotide sequence which encodes a portion of the amino acid sequence 
according to SEQ ID NO:2, wherein said portion is the mature protein, and 

e) a sequence which hybridizes to a), b), c) or d) under stringent hybridization 
conditions, wherein said sequence encodes the mature protein of the TGF-p family. 

14. The protein according to claim 13, wherein said protein has the amino acid 
sequence shown in SEQ ID NO:2. 

1 5. A pharmaceutical composition comprising at least one protein according to 
claim 13, in combination with a pharmaceutically acceptable carrier, auxiliary substance, 
diluent or filler. 

16. A method for treating or preventing damage to bone, cartilage, connective 
tissues, skin, mucous membranes, epithelium or teeth, comprising administering the 
protein according to claim 13 to a patient in need of such treatment. 

17. A method for improving wound healing and tissue regeneration, comprising 
administering the protein according to claim 1 3 to a patient in need of such treatment. 

18. A dental implant which contains the protein according to claim 13. 
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19. An antibody or antibody fragment which binds to the protein according to 
claim 13. 

20. An antibody or antibody fragment which binds to a protein comprising the 
amino acid sequence shown in SEQ ID NO:2 or biologically functional parts thereof. 

21 . A pharmaceutical composition according to claim 1 5, wherein the protein is 
contained on or in a natural or synthetically prepared matrix material. 

22. A pharmaceutical composition according to claim 21 , wherein said matrix 
material is a biocompatible porous material that can be biologically degraded. 

23. A protein of the TGF-(3 family encoded by a DNA molecule which is a part of 
SEQ ID NO:1 which encodes the mature protein and further comprises nucleotide 
sequences coding for signal and/or propeptide parts. 

24. A protein of the TGF-p family encoded by a DNA molecule which comprises 
nucleotides 640-2142 of SEQ ID NO:1.~ 

REMARKS 

The above amendments have been made to put the application into better 
condition for examination. 
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In this amendment, applicants also amend the specification by inserting a Sequence 
Listing as pages 36-52 and renumbering the pages containing the claims and abstract 
accordingly. 

It is respectfully submitted that the foregoing amendments and remarks place the 
application in compliance with 37 CFR 1.821 through 1.825 and no new matter is 
contained in this amendment. 

Enclosed is a computer-readable sequence listing in MS-DOS format on a 3.5" disk. 
The filename is 5649022.APP. 

Should any additional information be required, please contact applicants' 
undersigned representative at the telephone number listed below. 

In the event that any fees are due in connection with this paper, please charge our 
Deposit Account No. 14-1060. 

in the event this paper is not being timely filed, the Applicants respectfully petition 
for an appropriate extension of time. Any fees for such an extension together with any 
additional fees may be charged to Counsel's Deposit Account 14-1060. 

Respectfully submitted, 
NIKAIDO, MARMELSTEIN, MURRAY & ORAM LLP 

Monica Chin Kitts 
Attorney for applicants 
Reg. No. 36,105 

Atty. Docket No. P564-9022 
Metropolitan Square 
655 Fifteenth Street, N.W. 
Suite 330 - G Street Lobby 
Washington, D.C. 20005-5701 
Tel (202) 638-5000 
MCK:TPC:ecm 
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New growth/differentiation factor 
of the TGF-B family 



DESCRIPTION 



The present invention concerns a new growth/ differen- 
tiation factor of the TGF-B family and DNA sequences 
coding therefor. 

The TGF-B family of growth factors which includes BMP-, 
TGF- and inhibin-related proteins (Roberts and Sporn, 
Handbook of Experimental Pharmacology 95 (1990), 419- 
472) is particularly relevant for a wide range of 
medical treatment methods and applications. These 
factors are suitable in methods which concern wound- 
healing and tissue regeneration. Furthermore several 
members from the TGF-B family induce tissue growth, in 
particular growth of bones, and therefore play a crucial 
role in inducing the development of cartilage and bones. 

Wozney (Progress in Growth Factor Research 1 (1989) , 
267-280) and Vale et al (Handbook of Experimental 
Pharmacology 95 (1990) , 211-248) describe various growth 
factors such as those which are related to the BMP group 
(bone morphogenetic proteins) and the inhibin group. The 
members of these groups show significant structural 
similarities. The precursor of the protein consists of 
an amino-terminal signal sequence, a propeptide and a 
carboxy-terminal sequence of about 110 amino acids that 
are cleaved from the precursor and constitute the mature 
protein. In addition their members are defined by an 
amino acid sequence homology. The mature protein 
contains the most conserved sequences, in particular 
seven cysteine residues which are conserved among the 
family members. The TGF-B-like proteins are 
multifunctional, hormonally active growth factors. They 
also have related biological activities such as 
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chemotactic attraction of cells, promotion of cell 
differentiation and tissue-inducing capabilities such as 
cartilage-inducing and bone-inducing capabilities. The 
US Patent No. 5,013,649 discloses DNA sequences that 
code for osteo- inductive proteins that are denoted BMP-2 
and the US Patent Applications series No. 179 101 and 
170 197 disclose the BMP proteins BMP-1 and BMP-3 . 
Moreover many types of cells are able to synthesize TGF- 
13-like proteins and practically all cells have TGF-B 
receptors . 

On the whole these proteins show differences in their 
structure which leads to significant variations in their 
exact biological function. In addition they are found in 
a wide range of different types of tissue and at various 
stages of development. As a result they can exhibit 
differences with regard to their exact function e.g. the 
required cellular physiological environment, their life- 
span, their target sites, their requirements for 
auxiliary factors and their stability against 
degradation. Thus, although a multitude of proteins have 
been described that exhibit a tissue-inductive and in 
particular osteo-inductive potential, their natural 
functions in the organism and - more significantly - 
their medical relevance still have to be investigated in 
detail. It is thought to be highly probable that members 
of the TGF-6 family are present that are still unknown 
which are important for osteogenesis or the 
differentiation/ induction of other types of tissue. A 
major difficulty in the isolation of these new TGF-fi- 
like proteins is, however, that their functions cannot 
yet be described exactly enough to develop highly 
discriminating bioassays. On the other hand the expected 
nucleotide sequence homology to other members of the 
family is too low to enable a screening by classical 
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nucleic acid hybridization techniques. Nevertheless the 
further isolation and characterization of new TGF-fc-like 
proteins is urgently required in order to provide 
further inducing and differentiation-promoting proteins 
that fulfil all the desired medical requirements. These 
factors could be used medically in the healing of 
lesions and the treatment of degenerative diseases of 
bones and/ or other types of tissue such as the kidney or 
the liver. 

A nucleotide and amino acid sequence for the TGF-B 
protein MP-52 is stated in the Patent Application 
PCT/EP93/ 00350 in which the sequence corresponding to 
the mature peptide and a major portion of the sequence 
corresponding to the propeptide of MP-52 are given. The 
complete sequence of the propeptide MP-52 is not 
disclosed. 

The object on which the present invention is based is to 
provide DNA sequences that code for new members of the 
TGF-B protein family with mitogenic and/or 
differentiation-inductive e.g. osteo-inductive 
potential. The object of the present invention was 
therefore in particular to provide the complete DNA and 
amino acid sequence of the TGF protein MP-52. 

This object is achieved by a DNA molecule that codes for 
a protein of the TGF-B family and which comprises 

(a) the part coding for the mature protein and if 
desired further functional parts of the nucleotide 
sequence shown in SEQ ID NO. l, 

(b) a nucleotide sequence corresponding to a sequence 
from (a) within the scope of the degeneracy of the 
genetic code, 



(c) an allelic derivative of a nucleotide sequence 
corresponding to one of the sequences from (a) and (b) 
or 

(d) a sequence hybridizing with one of the sequences 
from (a) , (b) or (c) 

provided that a DNA molecule according to (d) contains 
at least the part coding for a mature protein from the 
TGF-B family. 

Further embodiments of the present invention concern the 
subject matter of claims 2 to 10. Other features and 
advantages of the invention can be derived from the 
description of the preferred embodiments and figures. 
The sequence protocols and figures are now briefly 
described. 

SEQ ID NO. 1 shows the complete nucleotide sequence of 
the DNA coding for the TGF-B protein MP-52 . The ATG 
start codon starts with nucleotide 64 0. The start of the 
mature protein begins after nucleotide 1782. 

SEQ ID NO. 2 shows the complete amino acid sequence of 
the TGF-B protein MP-52 which was derived from the 
nucleotide sequence shown in SEQ ID NO. l. 

Figure 1 shows a comparison between the amino acid 
sequence of MP-52 and several members of the BMP protein 
family starting with the first of the seven conserved 
cysteine residues. * denotes that the amino acid is the 
same in all compared proteins; + denotes that the amino 
acid corresponds in at least one of the proteins 
compared to MP-52 . 
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Figure 2 shows the nucleotide sequences of the 
oligonucleotide primers that were used in the present 
invention and a comparison of these sequences with known 
members of the TGF-B family. M denotes A or C, s denotes 
C or G, R denotes A or G and K denotes G or T. 2a shows 
the sequence of the primer OD, 2b shows the sequence of 
the primer OID. 

The present invention encompasses at least the part 
coding for the mature protein and if desired further 
functional parts of the nucleotide sequence shown in SEQ 
ID NO.l as well as sequences that correspond to this 
sequence within the scope of the degeneracy of the 
genetic code and allelic derivatives of such sequences. 
In addition the present invention also encompasses 
sequences that hybridize with such sequences provided 
that such a DNA molecule completely contains at least 
the part coding for the mature protein of the TGF-6 
family. 

The term "functional part" within the sense of the 
present invention denotes a protein part which is 
capable of acting for example as a signal peptide, 
propeptide or as a mature protein part i.e. it fulfils 
at least one of the biological functions of the natural 
protein parts of MP-52. 

The region coding for the mature part of the protein 
extends from nucleotides 1783 - 2142 of the sequence 
shown in SEQ ID NO.l. If desired, the DNA molecule can 
also comprise further functional parts of the sequence 
shown in SEQ ID NO. 1, namely the nucleotide sequences 
. coding for the signal or/ and propeptide part. It is 
particularly preferred that the DNA molecule comprises 
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the sequence for the signal and the propeptide part and 
the part of the mature protein i.e. nucleotides 64 0-214 2 
of the sequence shown in SEQ ID NO. 1. On the other hand 
the DNA molecule can also comprise functional signal 
or /and propeptide parts from other proteins in addition 
to the part coding for the mature protein, in particular 
from other proteins of the TGF-B family e.g. the above- 
mentioned BMP proteins. The respective nucleotide 
sequences are given in the references mentioned above to 
the disclosure of which reference is hereby being made. 

Moreover the present invention also encompasses a DNA 
molecule as defined above that contains a non-coding 
intron sequence between nucleotides 127 0 and 12 71 of the 
sequence shown in SEQ ID NO. 1. This intron sequence is 
contained in the plasmid SKL 52 (H3) MP12 which is 
deposited at the DSM and has the genomic nucleic acid 
sequence of MP-52. 

The invention also encompasses the cDNA sequence of the 
MP-52 protein coded by the phage X 15.1. This sequence 
starts with nucleotide 321 of SEQ ID NO. 1. 

Although the allelic, degenerate and hybridizing 
sequences which are encompassed by the present invention 
have structural differences due to slight changes in 
their nucleotide or /and amino acid sequence, the 
proteins coded by such sequences still essentially have 
the same useful properties that enable their use in 
basically the same medical applications. 

The term "hybridization" according to the present 
invention means the usual hybridization conditions, 
preferably conditions with a salt concentration of 6 x 
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SSC at 62 to 66°C followed by a one hour wash with 0.6 x 
SSC, 0.1 % SDS at 62 to 66 °C. It is particularly 
preferred that the term "hybridization" denotes 
stringent hybridization conditions with a salt 
concentration of 4 x SSC at 62 to 66 °C followed by a one 
hour wash with 0.1 x SSC, 0.1 % SDS at 62 to 66 °C. 

Preferred embodiments of the present invention are DNA 
sequences as defined above that are obtainable from 
vertebrates, preferably mammals such as pigs, cows and 
rodents such as rats or mice and in particular from 
primates such as humans. 

A particularly preferred embodiment of the present 
invention is the sequence denoted MP-52 shown in SEQ ID 
NO. 1. The transcripts of MP-52 were obtained from 
embryonic tissue and code for a protein which has a 
considerable amino acid homology to the mature portion 
of BMP-like proteins (see Fig. l) . The protein sequences 
of BMP2 (=BMP2A) and BMP 4 (=BMP2B) are described by 
Wozney et al., Science 242 (1988), 1528-1534. The 
corresponding sequences of BMP5, BMP 6 and BMP 7 are 
described by Celeste et al. , Proc. Natl. Acad. Sci. USA 
87 (1990), 9843-9847. Several typical sequence 
homologies which are specific for known BMP sequences 
have also been found in the propeptide part of MP-52 
whereas other parts of the precursor part of MP-52 
exhibit considerable differences to BMP precursors. 

In addition the present invention concerns a vector that 
contains at least one copy of a DNA molecule according 
to the invention. The DNA sequence according to the 
invention is preferably operatively linked to an 
expression control sequence in such a vector. Such 
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vectors are suitable for the production of TGF-G-like 
proteins in stably or transiently transformed cells. 
Various animal, plant, fungal and bacterial systems can 
be used for the transformation and subsequent culture. 
The vectors according to the invention preferably 
contain sequences necessary for replication in the host 
cell and are autonomously replicable. Furthermore the 
use of vectors is preferred that contain selectable 
marker genes which can be used to detect transformation 
of a host cell. 

In addition the invention concerns a host cell which is 
transformed with a DNA according to the invention or 
with a vector according to the invention. Examples of 
suitable host cells include various eukaryotic and 
prokaryotic cells such as E. coli, insect cells, plant 
cells, mammalian cells and fungi such as yeast. 

In addition the invention concerns a protein of the TGF- 
& family which is coded by a DNA sequence according to 
claim 1. The protein according to the invention 
preferably has the amino acid sequence shown in SEQ ID 
NO. 2 or if desired functional parts thereof and 
exhibits biological properties such as tissue-inductive 
in particular osteo-inductive or/ and mitogenic 
capabilities that may be relevant for a therapeutic 
application. The above-mentioned characteristics of the 
protein can vary depending on the formation of 
homodimers or heterodimers . Such structures may also 
prove to be suitable for clinical applications. 

The biological properties of the proteins according to 
the invention, in particular the mitogenic and osteo- 
inductive potential can be determined for example in 



assays according to Seyedin et al., PNAS 82 (1985), 
2267-2271 or Sampath and Reddi, PNAS 78 (1981), 7599- 
7603. 



Furthermore the present invention concerns a process for 
the production of a protein of the TGF-6 family which is 
characterized in that a host cell transformed with a DNA 
according to the invention or with a vector according to 
the invention is cultured and the TGF-B protein is 
isolated from the cell or/ and culture supernatant. Such 
a process comprises culturing the transformed host cell 
in a suitable culture medium and purifying the TGF-6- 
like protein produced. In this manner the process 
enables the production of an adequate amount of the 
desired protein for use in medical treatment or in 
applications using cell culture techniques which require 
growth factors. The host cell can be a bacterium such as 
Bacillus or E. coli, a fungus such as yeast, a plant 
cell such as tobacco, potato or arabidopsis or an animal 
cell and in particular a vertebrate cell line such as 
MoCOS or CHO cell lines or an insect cell line. 

Yet another subject matter of the present invention is 
the provision of pharmaceutical compositions which 
contain a pharmaceutical^ active amount of a TGF-6-like 
protein according to the invention as the active 
substance. If desired, such a composition comprises a 
pharmaceutically acceptable carrier substance, auxiliary 
substance, diluent or filler. Such a pharmaceutical 
composition can be used in wound-healing and tissue 
regeneration as well as in the healing of damage to 
bones, cartilage, connective tissue, skin, mucous 
membranes, epithelium or teeth and in dental implants 
either alone or in combination with other active 
substances e.g. other proteins of the TGF-6 family or 
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growth factors such as EGF (epidermal growth factor) or 
PDGF (platelet derived growth factor) . Moreover such a 
pharmaceutical composition can be used in the prevention 
of diseases such as the prevention of osteoporosis and 
arthrosis. 

Another possible clinical application of the TGF-J5 
protein according to the invention is to use it as a 
suppressor of an immunoreaction to prevent rejection of 
organ transplants or its application in connection with 
angiogenesis. The pharmaceutical composition according 
to the invention can also be used prophylactically or in 
cosmetic surgery. In addition administration of the 
composition is not limited to humans but can also 
encompass animals and in particular pets. 

Finally the present invention also concerns an antibody 
that binds specifically to the proteins according to the 
invention or such an antibody fragment (e.g. Fab or 
Fab'). Processes for the production of such a specific 
antibody or antibody fragment are part of the general 
technical knowledge of an average person skilled in the 
art. Such an antibody is preferably a monoclonal 
antibody. Such antibodies or antibody fragments may also 
be suitable for diagnostic methods. 

It is intended to elucidate the invention further by the 
following example. 



Example 1 

Isolation of MP-52 



1.1 Total RNA was isolated from human embryonic tissue 
(8 to 9 weeks old) according to the method of 
Chirgwin et al., Biochemistry 18 (1979), 5294-5299. 
Poly(A+) RNA was separated from the total RNA by 
oligo (dT) chromatography according to the 
manufacturer's instructions (Stratagene Poly (A) 
Quick columns) . 

1.2 For the reverse transcription reaction 1 to 2.5 /zg 
poly (A+) RNA was heated for 5 minutes to 65°C and 
guickly cooled on ice. The reaction mixture 
contained 27 U RNA-Guard (Pharmacia), 2 . 5 tig oligo 
(dT) 12-18 (Pharmacia), 5 x buffer (250 mmol/1 
Tris/HCl, pH 8.5, 50 mmol/1 MgCl 2 , 50 mmol/1 DTT, 

5 mmol/1 of each dNTP, 600 mmol/1 KCl) and 20 U AMV 
reverse transcriptase (Boehringer Mannheim) per fxg 
poly (A+) RNA. The reaction mixture (25 /il) was 
incubated for 2 hours at 42°C. 

1.3 The deoxynucleotide primers OD and OID shown in 
Fig. 2 were prepared on an automatic DNA 
synthesizer (Biosearch) . The purification was 
carried out by denaturing polyacrylamide gel 
electrophoresis and isolating the main bands from 
the gel by means of isotachophoresis . The 
oligonucleotides were designed by comparing the 
nucleic acid sequences of known members of the TGF- 
£ family and selecting regions with the highest 
conservation. A comparison of this region is shown 
in Fig. 2. In order to facilitate cloning both 
nucleotides contained EcoRI restriction sites and 
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OD additionally contained a Ncol restriction site 
at its 5 ' terminus . 

1.4 cDNA corresponding to 20 ng poly (A+) RNA was used 
as the starting material in the PCR reaction. The 
reaction was carried out in a volume of 50 /il and 
contained 1 x PCR buffer (16.6 mmol/1 (NH 4 ) 2 S0 4 , 

67 mmol/1 Tris/HCl pH 8.8 f 2 mmol/1 MgCl 2/ 
6.7 jumol/1 EDTA, 10 mmol/1 6-mercapto-ethanol, 
170 jtig/ml bovine serum albumin (Gibco) , 200 jumol/1 
of each dNTP (Pharmacia) , 30 pmol of each 
oligonucleotide (OD and 0ID) and 1.5 U Taq 
polymerase (AmpliTaq, Perkin Elmer Cetus) . The 
reaction mixture was overlayed with paraffin and 4 0 
PCR cycles were carried out. The products of the 
PCR reaction were purified by phenol/chloroform 
extraction and concentrated by ethanol 
precipitation . 

1.5 The PCR reaction product was cleaved with the 
restriction enzymes SphI (Pharmacia) and AlwNI 
(Biolabs) according to the manufacturer's 
instructions . 

1.6 The products of the restriction cleavage were 
fractionated by Agarose gel electrophoresis. After 
staining with ethidium bromide, uncleaved 
amplification products were cut out of the gel and 
isolated by phenol extraction. The DNA obtained was 
subsequently purified twice by phenol/chloroform 
extraction. 
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1.7 After an ethanol precipitation, a quarter or a 
fifth of the isolated DNA was reamplif ied using the 
same conditions as for the primary amplification 
except that the number of cycles was reduced to 13 . 
The reamplif ication products were purified, cleaved 
with the same enzymes as above and the uncleaved 
products were isolated from Agarose gels as 
elucidated above for the amplification products. 
The reamplif ication step was repeated twice. 

1.8 After the last isolation from the gel, the 
amplification products were cleaved by 4 units 
EcoRI (Pharmacia) under the conditions recommended 
by the manufacturer. A fourth of the restriction 
mixture was ligated into the vector pBluescriptll 
SK+ (Stratagene) cleaved with EcoRI. 24 clones were 
analysed further by means of sequencing after 
ligation. The sample cleaved with AlwNI and SphI 
resulted in a new sequence that was denoted MP-52. 
The other clones mainly contained BMP 6 sequences 
and one contained a BMP7 sequence. 

The clone was completed up to the 3 • end of the cDNA 
according to the method described in detail by Frohmann 
(Amplifications, published by Perkin-Elmer Corp., Issue 
5 (1990) , pp 11-15) . The same embryonic mRNA that had 
been used to isolate the first fragment of MP-52 was 
reversally transcribed as described above. The 
amplification was carried out using the adapter primer 
(AGAATTCGCATGCCATGGTCGACG) and an internal primer 
(CTTGAGTACGAGGCTTTCCACTG) of the MP-52 sequence. The 
amplification products were reamplified using an 
overlapping adapter primer (ATTCGCATGCCATGGTCGACGAAG) 
and an overlapping internal primer 

(GGAGCCCACGAATCATGCAGTCA) of the MP-52 sequence. After 
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restriction cleavage with Ncol the reamplif ication 
products were cloned and sequenced into a vector that 
was cleaved in the same way (pUC 19 (Pharmacia No. 27- 
4951-01) having a modified multiple cloning site which 
contains a single Ncol restriction site) and sequenced. 
The clones were characterized by their sequence 
overlapping at the 3' end of the known MP-52 sequence. 
One of these was used as a probe to screen a human 
genomic gene bank (Stratagene No. 946203) according to a 
method described in detail by Ausubel et al. (Current 
Protocols in Molecular Biology, published by Greene 
Publishing Associates and Wiley-Interscience (1989)). 
One phage (X.2.7.4) was isolated from 8 x 10 5 X phages 
which contained an insertion of about 2 0 kb and which is 
deposited at the DSM under the depository number 7 387. 
This clone contains further sequence informations at the 
5 ' end in addition to the sequence isolated from mRNA by 
the described amplification methods. 

For the sequence analysis a Hindlll fragment of about 
7 . 5 kb was subcloned into a vector cleaved in the same 
manner (Bluescript SK, Stratagene No. 2122 06) . This 
plasmid denoted SKL 52 (H3) MP12 was also deposited at 
the DSM under the depository number 7353. The sequence 
information shown in SEQ ID NO. 1 was derived from the 
phage X. 2.1. A.. The ATG at position 640 is the first ATG 
within the reading frame (a stop codon occurs at 
position 403) . Based on the sequence data it may be 
assumed that this is the start codon for the 
translation. 

The genomic DNA contains an intron of about 2 kb between 
base pairs 1270 and 1271 of SEQ ID NO: 1. The sequence 
of the intron is not shown. The correctness of the 
splicing site was confirmed by sequencing an 
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amplification product which was derived from cDNA 
containing this region. These sequence informations were 
obtained using a slightly modified method which is 
described in detail by Frohmann (Amplifications, 
published by Perkin-Elmer Corporation, Issue 5 (199 0) , 
pp 11-15) . The same embryonic RNA that was also used to 
isolate the 3 ' end of MP-52 was reverse transcribed 
using an internal primer orientated in the 5' direction 
of the MP-52 sequence (ACAGCAGGTGGGTGGTGTGGACT) . A 
polyA tail was attached to the 5 ' end of the first 
cDNA strand using terminal transferase. A two-step 
amplification was carried out, firstly by using a 
primer composed of oligo dT and an adapter sequence 
(AGAATTCGCATGCCATGGTCGACGAAGC (T16) ) and secondly an 
adapter primer (AGAATTCGCATGCCATGGTCGACG) and an 
internal primer (CCAGCAGCCCATCCTTCTCC) from the MP-52 
sequence. The amplification products were reamplified 
using the same adapter primer and an overlapping 
internal primer (TCCAGGGCACTAATGTCAAACACG) from the MP- 
52 sequence. Subsequently the reamplif ication products 
were reamplified using an overlapping adapter primer 
(ATTCGCATGCCATGGTCGACGAAG) and an overlapping internal 
primer (ACTAATGTCAAACACGTACCTCTG) from the MP-52 
sequence. The final reamplif ication products were cloned 
with blunt ends into a vector (Bluescript SK, Stratagene 
No. 212206) which had been cleaved with EcoRV. The 
clones were characterized by their sequence overlapping 
with the DNA of X 2.7.4.. 

In addition a cDNA bank - produced from RNA of human 
fibroblasts and cloned into XgtlO - was screened. In 
this process 2 x 10 6 phages were tested using a ca. 1 kb 
fragment of genomic MP-52 DNA (2nd exon up to the 
Hindlll restriction site in the 3 ' untranslated region) 
as a radioactive probe. 17 mixed plaques were picked out 
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and were checked by PCR using primers from the 5' and 3' 
region of the MP-52 sequence. Subsequently 8 phage 
plaques were selected and isolated. cDNA was isolated 
from the phage by an EcoRI partial cleavage and cloned 
into the Bluescript vector that was also cleaved with 
EcoRI . 

Sequencing of one of the resulting plasmids 
SK52L15. 1MP25 showed that the longest phage (15.1) 
starts at nucleotide No. 321 of SEQ ID NO. 1. In 
addition the splicing position (nucleotide 1270) was 
confirmed by the sequencing. 

The plasmid SKL 52 (H3) MP12 was deposited on 10th 
December 1992 at the DSM ("Deutsche Sammlung von 
Mikroorganismen und Zellkulturen, Mascheroder Weg lb, 
38124 Braunschweig) under the depository number 7353. 

The phage X 2.1.4 was deposited on 13th January 1993 at 
the DSM under the depository number 7387. 

The plasmid SK52L15 . 1MP25 was deposited on 16th July 
1993 at the DSM under the depository number 8421. 

Example 2 
Expression of MP52 

Various systems were checked for the expression of MP52 . 
The use of Vaccinia viruses as an expression system is 
described in detail and capable of being reproduced by a 
person skilled in the art in Current Protocols in 
Molecular Biology (Ausubel et al., Greene Publishing 
Associates and Wiley-Interscience , Wiley & Sons) , 
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abbreviated CP in the following, in chapter 16 unit 
16.15-16.18. The system is based on the fact that 
foreign DNA can be integrated by homologous 
recombination into the genome of the Vaccinia virus 
using certain vectors. For this purpose the vector used 
contains the TK {thymidine kinase) gene from the 
Vaccinia genome. In order to enable selection for 
recombinant viruses the vector in addition contains the 
E. coli xanthine-guanine phosphor ibosyl transferase gene 
(gpt) (Falkner et al., J. Virol. 62 (1988), 1849-1854). 
The cDNA with the entire region coding for MP52 was 
cloned into this vector. The cDNA comes from plasmid 
SK52L15. 1MP25 {DSM, depository number 8421) which was, 
however, firstly deleted and subcloned in order to 
remove a large portion of the 5' untranslated region. 
For this the plasmid SK52L15. 1MP25 was linearized with 
Sail and the 5 1 end was deleted stepwise with the 
ExoIII/mung bean kit (Stratagene #200330) according to 
the manufacturer's instructions. After restriction with 
BamHI, the MP52 cDNAs that had been deleted to different 
extents were separated from the residual vector and 
isolated by an agarose gel and subcloned (pSK52s) 
according to standard methods (Sambrook et al. , 
Molecular Cloning, second edition, Cold Spring Harbor 
Laboratory Press 1989) in a pBluescriptll SK vector 
(Stratagene #212206) restricted with EcoRV and BamHI. 
All restrictions were carried out according to the 
manufacturer's instructions. Sequencing with Sequenase 
(USB/Amersham #70770) yielded inter alia a clone which 
starts with nucleotide 576 in SEQ ID NO. 1 (64 base 
pairs distant from the start codon) . The cDNA insert was 
isolated from this by restriction with Sail and SacI and 
cloned into the likewise cleaved vector for 
recombination in the vaccinia virus system. The 
resulting plasmid (pBPlMP52s) was deposited on 24th May 
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1994 at the DSM (deposit number 9217) and used for the 
production of recombinant Vaccinia viruses. For this up 
to 80 % confluent 143B cells (HuTk, ATCC CRL 8303) in 35 
mm culture dishes were infected with Vaccinia wild-type 
virus in 2 ml PBS for 30 minutes at room temperature 
while shaking occasionally (1 virus per 10 cells) . After 
aspirating the supernatant and adding 2 ml culture 
medium (MEM, Gibco BRL #041-01095) , it was incubated for 
2 hours at 37 °C. The medium was subsequently removed and 
transformation of these cells was achieved with 100 ng 
pBPlMP52s, 2 jug carrier DNA (calf thymus, Boehringer 
Mannheim #104175) and 10 fil Lipofectin (Gibco BRL # 
18292-011) in 1 ml MEM for 15 hours at 37 °C. After 
addition of 1 ml MEM containing 20 % FCS (Gibco BRL 
#011-06290) , they were incubated for a further 24 hours 
at 37 °C and subsequently the lysed cells were frozen. 

The gpt selection for xanthine-guanine phosphoribosyl 
transferase and isolation and amplification of 
individual recombinant viruses was essentially carried 
out as described in unit 16.17 of CP except that RK13 
cells (ATCC CCL 37) were used. 

Integration of MP52 cDNA into the virus genome was 
confirmed by dot blot and Southern blot analysis (CP 
unit 16.18). A recombinant virus was used for expression 
analyses in the cell line 143B (HuTk-, ATCC CRL 8303, 
human) . Confluent cells were infected for 45 minutes at 
37 °C with a number of viruses corresponding to the 
number of cells and subsequently added to the respective 
culture medium (MEM, Gibco BRL #041-01095) containing 
10 % FCS and penicillin/ streptomycin (1:500, Gibco BRL # 
043-05140H) . After 6 hours at 37°C, the medium was 
removed, the cells were washed twice with e.g. HBSS 
(Gibco BRL #042-04 180M) and production medium (e.g. MEM) 
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without FCS was added. After 2 0 to 22 hours of 
production the cell supernatant was collected. The 
expression was analysed by means of Western blots 
according to standard methods (CP unit 10.8). For this 
the proteins from 100 to 500 fil cell culture supernatant 
were precipitated by addition of an equivalent volume of 
acetone and incubating for at least one hour on ice and 
centrifuged. After resuspending the pellet in 
application buffer (7 M urea, 1 % SDS, 7 mM sodium 
dihydrogen phosphate, 0.01 % bromophenol blue and if 
desired 1 % B-mercaptoethanol) separation was carried 
out in 15 % polyacrylamide gels. A prestained protein 
molecular weight standard (Gibco BRL #26041-020) was 
used as the marker proteins. Transfer onto a PVDF 
membrane (Immobilon #IPVH00010) and blocking the 
membrane were carried out according to standard methods. 

In order to detect MP52 on the membrane, polyclonal 
antibodies against MP52 had been produced in chickens as 
well as in rabbits. For this the mature part of MP52 
with 6 histidines at the N-terminus was expressed in E. 
coli and purified as described for example in Hochuli et 
al. (BIO/Technology, Vol. 6, 1321-1325 (1988)). Both 
antibodies enable the specific detection of expression 
of MP52 wherein dimeric MP52 is less efficiently 
recognized than monomeric. Chicken antibodies were used 
for the Western blot in Figure 3 that had been 
specifically purified by means of PEG precipitation 
(Thalley et al., BIO/Technology vol. 8 934-938 (1990)) 
and by means of membrane-bound antigen (mature MP52 
containing 6 histidines) (18.17 in Sambrook et al., 
Molecular cloning, second edition, Cold Spring Harbor 
Laboratory Press 1989) . Anti-chicken IgG with coupled 
alkaline phosphatase (Sigma A9171) was used as the 
second antibody. The detection was carried out using the 
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Tropix Western-Light protein detection kit (Serva 
#WL10RC) according to the manufacturer's instructions. 

The Western blot in Figure 3 shows that MP52-specif ic 
bands only occur with the recombinant viruses but not 
with the wild-type viruses (without integrated foreign- 
DNA) . The expression of MP52 results in a secreted 
protein having an apparent molecular weight of about 25 
kDa in the gel under non-reducing conditions. The 
protein migrates in the gel at 14 to 15 kDa under 
reducing conditions. These results show that MP52 is 
expressed as a dimeric mature protein. The weak bands 
appearing in the region above 60 kDa that occur in the 
Western blot are probably residues of the uncleaved 
precursor proteins. The migration properties also 
confirms the theoretical molecular weights that can be 
derived from SEQ ID NO. 2 according to which mature, 
monomeric MP52 has a size of 13.6 kDa. 

It has been proven to be possible to express MP52 and 
cleave the precursor protein to mature MP52 in various 
cell lines. C127 (ATCC CRL 1616, mouse), BHK21 (ATCC CCL 
10, hamster), MRC-5 (ATCC CCL 171, human) and 3T6-Swiss 
albino (ATCC CCL 96, mouse) cells were tested. 

Expression and cleavage to form mature MP52 was also 
demonstrated in a further eukaryotic expression system. 
For this cDNA from MP52 (starting with nucleotide 57 6) 
was cloned into the expression plasmid pSG5 (Stratagene 
#216201) . The plasmid pSK52s was restricted with Clal 
and Xbal and the protruding ends of the MP52 insert were 
made blunt by T4 polymerase treatment. Cloning into the 
vector pSG5, that had been restricted with EcoRI and 
likewise blunt ended by T4 polymerase treatment, was 
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carried out according to standard methods. All enzymatic 
reactions ,were carried out according to the instructions 
of the manufacturer. Correct orientation of the MP52 
insert was ensured by restriction analysis and 
sequencing with the T7 primer (Stratagene #300302) . The 
resulting plasmid pSG52s (deposited on 17.05.94 at the 
DSM with the deposit number DSM 9204) can be 
cotransf ormed with a vector that codes for a selectable 
marker such as e.g. the gene for G418 resistance in 
order to obtain stable cell lines. For this purpose 
pSG52s was cotransf ormed with the plasmid p3 616 
(deposited on 17.05.94 at the DSM with the deposit 
number DSM 9203) in L929 cells (ATCC CCL1, mouse) using 
Lipofectin (Gibco BRL #18292-011) according to the 
manufacturer's instructions. Selection with G418 was 
carried out according to methods familiar to a person 
skilled in the art (CP, unit 9.5) and it resulted in a 
cell line that produced detectable mature MP52 in the 
Western blot. 

A further expression vector for MP52 was produced using 
the plasmid pABWN (Niwa et al. , Gene 108 (1991), 193-200 
and Figure 4) which was provided by Dr. Miyazaki. 

For this the Hindlll fragment from plasmid pSK52s that 
starts with nucleotide 57 6 in SEQ ID NO. 1, was isolated 
and the protruding ends were made blunt by treatment 
with Klenow fragment. A Not I restriction cleavage site 
was introduced at both ends of the fragment by ligation 
of the adapter. 
Adapter : AGCGGCCGCT 
TCGCCGGCGA 

Vector pABWN was restricted with Xhol, also treated with 
the Klenow fragment and dephosphorylated with intestinal 
alkaline phosphatase from the calf (Boehringer 
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Mannheim) . The same phosphorylated adapter was ligated 
on so that an insertion of the MP52 fragment after 
restriction with NotI into the generated Not I cleavage 
site of the vector was now possible. The expression 
vector that results is subsequently denoted Hindlll- 
MP52/pABWN. All the reactions carried out for the 
cloning were carried out according to standard methods 
(e.g. CP unit 3. 16) . 

The structure of the HindIII-MP52/pABWN expression 
vector was confirmed by sequencing and restriction 
mapping. HindIII-MP52/pABWN contains the MP52 sequence 
starting with nucleotide 576 and ending with nucleotide 
2278 in SEQ ID NO. 1. 

HindIII-MP52/pABWN was transfected in L cells* (mouse 
fibroblasts) and stable transf ormants were established 
therefrom. 

For this 4 ng in each case of the plasmids (Hindlll- 
MP52/pABWN or pABWN) were transfected in 5 x 10 5 L cells 
in a 6 cm culture dish using 20 Ml lipof ectAMINE reagent 
(Gibco BRL #18324-012) . For this solution A (4 fig of the 
respective DNA in 200 fil OPTI-MEM I (Gibco BRL #31985)) 
was carefully mixed with solution B (20 jtxl lipof ectAMINE 
reagent in 200 fil OPTI-MEM I) and incubated for 45 
minutes at room temperature to form the DNA liposome 
complex. In the course of this the cells were washed 
once with 2 ml OPTI-MEM I. For each transf ection, 1.6 ml 
OPTI-MEM I was added to the vessel with the DNA liposome 
complex. The solution was carefully mixed and the washed 
cells were overlayed therewith. The cells were incubated 
with the dilute complex for 5 hours at 37 °C in an C0 2 
incubator. After the incubation 2 ml DMEM (Gibco BRL, 
Dulbecco's modified eagle medium)/ 20 % FCS was added. 
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2 4 hours after the transf ection, the medium was replaced 
with fresh DMEM/ 10 % FCS. 48 hours after the start of 
transf ection, the cells were transferred into a 10 cm 
culture dish. 72 hours after the start of the 
transf ection, the G418 selection was started at a 
concentration of 800 /ig/ml. The stable clones appeared 
after 1 to 2 weeks. 

5 ml conditioned DMEM with or without FCS was obtained 
from confluent transf ormants which had been grown for 3 
days in a 10 cm culture dish. The two different cell 
culture supernatants (HindIII-MP52/pABWN and pABWN) of 
transfected cells as well as cell lysates were examined 
by Western blot. In this process mature MP52 was found 
in conditioned medium as well as in cell lysates of 
cells transfected with HindIII-MP52/pABWN. The clones 
were further cloned and cells producing MP52 were each 
selected after Western blot analysis. Estimations from 
Western blot analyses yielded a MP52 production of up 
to 1 mg/1. 

Example 3 : 

Biological activity of MP 5?. 

Several experiments were carried out in vitro and in 
vivo in order to detect the biological activity of MP52 
and to prove the usefulness of this invention for 
medical applications for the prevention and/or treatment 
of bone diseases. 



i 



1. In vitro assays 



1. 1 

Since an increase in glycosaminoglycan (GAG) synthesis 
is described in chondrocytes after stimulation with TGF- 
& (Hiraki et al. , Biochintica et Biophysica Acta 969 
(1988), 91-99), it was examined whether MP52 also has 
this influence. The chondrogenic activity of MP52 was 
tested in primary cultures from foetal rat extremities 
using the cell culture supernatants (DMEM containing 10 
% FCS) of L cell transformants producing MP52 
(transfected with Hindlll MP52/pABWN) . 

The four extremities of 16-day-old rat foetuses were 
used for this. After trypsination, the cells obtained in 
F-12 medium (Nutrient mixture Ham's F-12, Gibco BRL 
#21700) containing 10 % FCS were plated out at 3 x 10 5 
cells on 2 4 -well plates coated with collagen type I and 
cultured for ca. 2 days until confluence. 56 jul 
conditioned medium (CM) of HindIII-MP52/pABWN-L cell 
transfectants, of pABWN-L cell transf ectants or only 
medium (DMEM containing 10 % FCS) was added to 500 /nl 
culture medium in each case (F-12 medium containing 10 % 
FCS) . F-12 medium containing 10 % FCS as well as the 
respective additives was used over a period of 0, 3, 6 
and 9 days. The medium containing the respective 
additives was exchanged every three days. Afterwards the 
culture was cultured for a further 2 days in F-12 medium 
without FCS in the presence of the respective additives " 
(conditioned medium or control medium) and then 35 S 
sulfate was added for 6 hours. 35 S incorporated into 
polysaccharides was measured after pronase E digestion 
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and precipitation as described in Hiraki et al. 
(Biochimica et Biophysica Acta 969 (1988) , 91-99) . 



Table l: 



Radioactivity (cpm/well) 



Number of DMEM (10 % CM from CM from 

days of FCS) from pABWN-L cell Hindlll- 

incubation control L transf ectants MP52/pABWN-L 

cells cell 

transf ectants 



3720±114 3865±120 4879+422 

4188+135 4154+29 8223+275* 

3546+160 3310±115 9890±1260* 

3679+218 3633±167 7520+160* 



Values relate to + S.E.M. for 3 or 4 culture mixtures 
*:p<0.01 vs DMEM and CM from pABWN-L cell transf ectants 
(Scheffe's multiple t-test) 



As shown in Table 1, the cell culture supernatants of 
the transfectants producing MP52 significantly stimulate 
GAG synthesis in comparison to pure culture medium (DMEM 
containing 10 % FCS) or to the cell culture supernatant 
from L cells transfected with pABWN. This shows that 
MP52 can stimulate differentiation of chondrocytes. 



1.2 



An effect which has been described for some members of 
the BMP family is the stimulation of alkaline 
phosphatase (ALP) activity in osteoblasts. The clonal 
rat cell line ROB-C26 (C-26) is among the osteoblasts at 
a relatively early stage of maturation (Yamaguchi et al, 
Calcif. Tissue Int. 49 (1991), 221-225). The capability 
of stimulating ALP activity is described for 
osteoinductive proteins such as e.g. BMP-2 by Yamaguchi 
et al. (J. Cell Biol. 113 (1991), 681-687). 

The influence of MP52 on C2 6 cells was examined as 
follows: C2 6 cells were plated out at 3 x 10 4 cells per 
well in a 24-well plate and cultured in a -MEM (Gibco 
BRL) / 10 % FCS until confluence. 56 /a1 of the cell 
culture supernatant from L cell transf ectants producing 
MP52 (Hind III-MP52/pABWN) or of the cell culture 
supernatant from pABWN-L cell transf ectants or only of 
the cell culture supernatant (DMEM containing 10 % FCS) 
from L cells was added to 500 of the C-26 cell 
culture medium. A change of medium with the respective 
additives was carried out every three days. The ALP 
activity in the cell extracts was determined after 0, 3, 
6, 9 and 12 days with the aid of standard techniques 
based on p-nitrophenyl phosphate as the substrate as 
described for example by Takuwa et al. (Am. J. Physiol. 
257 (1989) , E797-E803) . 



Table 2 : 



ALP activity (nmol/min) per well 



Number of DMEM (10 % CM from pABWN CM from 

incubation FCS) from L cell Hindlll- 

days control L transf ectants MP52/pABWN-L 

cells cell 

transf ectants 



41.8+2.8 



136.3±3.7 



125.8+2.3 



181. 3+14.2* 



9 

12 



129.0+7.8 



118.4+3.7 



121. 2+3 . 2 



119.3+6.4 
110.1+2.8 
125.3+6.0 



2 58 . 0+8 .3* 

258.4+10.6* 

237.8+11.0* 



Values relate to + S.D. for 4 culture mixtures. 
*:p<0.01 vs DMEM and CM from pABWN-L cell transf ectants 
(Scheffe's multiple t-test) 



As shown in Table 2, addition of MP52 leads to a 
significant increase in ALP activity compared to pure 
DMEM/ 10 % FCS medium and medium from pABWN-inf ected L 
cells. This result shows that MP52 cannot only cause 
chondrocytes to differentiate but can also lead to the 
differentiation and maturation of osteoblasts. 



A further osteoblast cell line (MC3T3-E1, mouse) that 
shows an increase in the ALP activity by treatment with 
BMP-2 as described by Takuwa et al. (Biochem. Biophys. 
Res. Com. 174 (1991), 96-101), does not result in any 
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change in the ALP activity after incubation with 
conditioned medium from L cell transf ectants producing 
MP52 (HindIII-MP52/pABWN) or medium after MP52 
production by infection with recombinant Vaccinia 
viruses. This indicates that MP52 has a cell specificity 
that partially deviates from that of BMP-2. Different 
functions due to different target sites for the 
individual TGF-6 family members may be of great medical 
relevance. 

2. In vivo experiments 
2.1 

The most definitive possibility of examining bone 
development is based on ectopic bone formation in vivo. 
This can for example be induced by implantation of 
demineralized bone matrix (Urist, Science 150 (1965) , 
893-899) . The same process can be induced by combination 
of inactive matrix with bone-inducing proteins as 
described for example by Sampath et al. (PNAS* 
Proc. Natl. Acad. Sci. USA 78 (1981) , 7599-7603) . This process 
of bone formation is similar to embryonic enchondral 
bone formation and adult bone healing. This method 
therefore enables proteins to be examined for their 
bone- inductive capability in vivo. 

MP52 protein which had been obtained by expression in 
the Vaccinia system (see example 2) was partially 
purified and implanted for such an experiment. 

For this 143B cells (HuTk, ATCC CRL 8303) were cultured 
in culture dishes and roller flasks until confluence and 
infected with recombinant viruses as described in 
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example 2 for the expression analyses, they were washed 
and MP52 was allowed to accumulate for about 20 hours in 
MEM (Gibco BRL, ca 1 ml per 10 6 cells) . The same 
preparation was infected with wild-type viruses as a 
control. Cell culture supernatant (conditioned medium) 
from each preparation was collected and centrifuged 
(40000 x g for 30 minutes at 4°C). In order to remove 
the viruses, the supernatants were filtered over 
inorganic filters (0.1 pm pore size, Whatman, Anotop 
25) . In the course of the characterization of MP52 it 
was shown that this protein binds to heparin-Sepharose. 
This property was utilized for partial purification. For 
this the filtered and centrifuged, conditioned medium 
was brought to a final concentration of 50 mM Tris pH 
7.0, 100 mM NaCl and 6 M urea and it was loaded onto a 
heparin column (HiTrap™, Pharmacia #17-0407-01) that 
uses equilibrated in buffer A (50 mM Tris pH 7.0, 100 mM 
NaCl and 6 M urea) . The loaded column was washed with 
buffer A and eluted with a linear gradient to 100 % 
buffer B (50 mM Tris pH 7.0, 600 mM NaCl and 6 M urea) 
at a flow rate of 0.5 ml/min within 50 min (2.5 ml per 
fraction) . The use of urea is not absolutely necessary. 
MP52 elutes reproducibly mainly in 2 fractions at about 
250 to 4 00 mM NaCl as could be examined by Western blot 
analysis (see example 2) . Aliquots of these fractions 
were also examined in 15 % polyacrylamide gels stained 
with silver according to the instructions of the 
manufacturer (Silver Stain-II, Daiichi #SE140000) and 
the fractions were pooled. The comparable fractions were 
also pooled after analysis in gels stained with silver 
after purification from conditioned medium after 
infection with wild-type viruses. 

Further examinations on MP52 showed that MP52 also binds 
to hydroxy apatite. Thus it is in principle possible to 
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achieve an additional purification by a hydroxyapatite 
column or to replace a heparin column by a 
hydroxyapatite column (e.g. BIO-RAD, Econo-pac HTP) . 
Other methods known to a person skilled in the art are 
also conceivable for further purifications such as e.g. 
gel sieve columns, ion exchanger columns, affinity 
columns , metal chelate columns or columns based on 
hydrophobic interactions. 

The MP52 protein prepurified by heparin-Sepharose 
chromatography or the corresponding proteins that are 
still contaminated which are also present in the cell 
culture supernatants infected with the wild-type, were 
further purified by means of reversed phase HPLC. For 
this a C8 column (Aguapore RP3 00, Applied Biosystems, 
particle size: 7 nn, pore size: 300 A) was equilibrated 
with 10 % buffer B (buffer A: 0.1 % trif luoroacetic 
acid; buffer B: 90 % acetonitrile, 0.1 % trif luoroacetic 
acid) . After loading the column with the pooled 
fractions containing MP52 from the heparin column it was 
extensively washed with 10 % buffer B. The bound protein 
was eluted with the following gradient: 10 to 50 % 
buffer B for 2 0 minutes and 50 to 100 % buffer B for 50 
minutes. Fractions of 500 Ml were collected and analysed 
by Western blot as well as with gels stained with 
silver. The MP52 protein elutes under the selected 
conditions in the range of about 55 to 65 % 
acetonitrile. The fractions containing MP52 were pooled. 
The same procedure was carried out with the 
corresponding fractions from the control purification of 
cell culture supernatant from cells infected with wild- 
type viruses . 

Partially purified MP52 protein at a concentration 
estimated to be 50 ng/ml according to Western blot 
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analysis also showed a significant increase in the ALP 
activity in ROB-C2 6 cells after three days of 
incubation . 

Partially purified MP52 protein or control protein from 
the corresponding partially purified cell culture 
supernatants after infection with wild-type viruses were 
reconstituted with matrix and implanted in rats in order 
to prove its capability for cartilage and bone 
formation. 

In principle various matrix materials known to a person 
skilled in the art should be usable i.e. natural (also 
modified) and synthetically prepared matrices, however, 
biocompatible porous materials that can be biologically 
degraded are preferred. In these experiments bone matrix 
from rats was used that had been prepared essentially in 
a similar way to that described by Sampath et al. (PNAS 
80 (1983), 6591-6595). The rat bones (femur and tibia) 
were demineralized in 0.6 M HCl for 24 hours and 
subsequently bone marrow that was still present was 
removed. After washing with water and defatting for 
three hours in a chlorof orm/methanol (1/1) mixture, the 
bones were air-dried, powder ized in a mill in a deep- 
frozen state and particle sizes between 4 00 and 1000 /zm 
were sieved out. Subsequently the matrix was extracted 
for 7 days at room temperature in 4 M guanidinium HCl in 
the presence of protease inhibitors. After washing 
extensively with water, the matrix was lyophilized and 
stored at 4°C. Matrices treated in this way do not on 
their own show bone-inducing activity. 

Protein can be combined with the extracted bone matrix 
by various methods known to a person skilled in the art. 
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MP52 protein or control protein that had been purified 
by means of heparin-Sepharose as well as by reversed 
phase HPLC , was combined after elution in the 
acetonitrile/trifluoro-acetic acid solution with 25 mg 
matrix in each case per implant, mixed well, deep-frozen 
and lyophilized. 

For the implantation of matrix-bound MP52, two ca. 3 
months-old rats (Whistar) were used which had been 
anaesthetised by intramuscular injection of an 
anaesthetic (0.2 ml Rompun (Bayer) mixed with 0.5 ml 
Ketanest 50 (Parke Davis)) using 0.14 ml per 100 g body 
weight. Bilateral pockets were prepared in the abdominal 
muscles for the implants (beneath the thorax, starting 
ca. 0.5 cm below the lowest costal arch). The matrix- 
bound MP52 (ca. 2 to 4 Mg as estimated by Western blot) 
as well as the corresponding matrix-bound control 
proteins were moistened using 0.9 % saline solution 
(Delta Pharma) and introduced into the muscular pockets. 
The muscular pockets as well as the necessary skin 
incisions were subsequently sutured. The rats were 
immunosuppressed with Cyclosporin A (Sandimmun) . 

After 18 or after 26 days the implants were removed from 
the rats and fixed for histological examinations. Since 
after 26 days the implant with MP52 allowed the 
assumption that macroscopically bone formation had 
already occurred, this was embedded in methylmeth- 
acrylate in order to prepare thin sections, the other 
implants were embedded in paraffin. Mineralized 
cartilage and bone tissues are accentuated in black by 
means of the von Kossa staining technique (Romeis, B. ; 
"Mikroskopische Technik", Ed: Bock, P.; Urban and 
Schwarzenberg; Munich, Baltimore, Vienna (1989)). In the 
trichromium staining according to Masson-Goldner 
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(Romeis, B. ; "Mikroskopische Technik" , Ed: Bock, P.; 
Urban and Schwarzenberg; Munich, Baltimore, Vienna 
(1989)), mineralized bone tissue and collagen are 
stained bright green, osteoid is stained red and 
cytoplasm reddish-brown. Both staining techniques were 
used on implants from both rats. In both experimental 
animals considerable formation of cartilage and bone was 
detected in the implants containing MP52 using both 
staining techniques. The corresponding implants with 
control protein showed no formation whatsoever of 
cartilage or bone. The number of cartilage precursors 
with chondrocytes and cartilage areas with initial 
formation of extracellular matrix and its mineralization 
in concentric circles is higher in the MP52 implant 
after 18 days than in the one after 2 6 days. Mature bone 
tissue with vectorial osteoid formation as well as 
individual osteocytes in the bone is, however, also 
detectable in the implant after 18 days. In addition 
closed ossicles can be observed with the onset of bone 
marrow formation. In the implant after 2 6 days areas of 
cartilage with initial matrix formation and 
calcification are also detectable, the portion of 
mineralized bone tissue stained green and having 
osteocytes and osteoid edges has, however, significantly 
increased. In this implant bone marrow formation 
together with the occurrence of isolated fat cells can 
also be detected. For illustration Figure 5 shows the 
staining test of the bone material (according to von 
Kossa stain. ) from the entire implant after 2 6 days. A 
small section of the same implant is shown in Figure 6 
after staining according to Masson-Goldner . It shows 
active bone with an edging of cuboidal osteoblasts and 
osteoid in which individual embedded osteoblasts can be 
recognized. Furthermore individual osteocytes can also 
be seen in the mineralized bone tissue (stained green in 



the original preparation) . The formation of bone marrow 
is also detectable. 



The experiment shows that recombinant ly produced MP52 
alone, in combination with a matrix is capable of 
inducing enchondral bone formation. 

2.2 

In order to confirm the results, a further ectopic test 
for bone formation using MP52 L cell transf ormants was 
carried out. L cells (1 x 10 6 cells) producing MP52 
(transfected with HindIII-MP52/pABWN) and non-producing 
( pABWN -trans fected) L cells were injected into the 
bilateral thigh muscles of three male naked mice in each 
case. All animals were killed after three weeks, the 
thigh muscles were excised and these were examined with 
low energy X-ray radiation as well as 
histopathologically . 

Analysis by X-ray radiation shows dense material at the 
injection sites in the muscle tissue of all L cells 
producing MP52 as listed in Table 3. Simple cartilage 
formation and calcified cartilage formation could be 
determined in the muscles using histological 
examinations. These results also confirm that MP52 can 
induce enchondral bone formation. 



dense material by 
X-ray analysis 

chondrocytes by 
histology 

calcified 
cartilage 
formation by 
histology 



Cells producing 
MP52 

(Hindlll- 
MP52/pABWN) 

3/3 



3/3 



Control cells 
(pABWN) 



0/3 
0/3 



The experiments that were carried out confirm that MP52 
protein stimulates the formation of cartilage from 
undifferentiated mesenchymal cells as well as the 
differentiation and maturation of osteoblasts. This 
leads to enchondral bone formation which is similar to 
the induction cascade in embryonic bone formation and 
bone healing of fractures. 



The conditions stated in the experiments are to be 
looked upon as an illustration of the MP52 activity but 
not as limitation. The invention can also be examined 
and characterized in another form. 



CLAIMS 



1. DNA molecule that codes for a protein of the TGF-B 
family and which comprises 

(a) the part coding for the mature protein and if 
desired further functional parts of the nucleotide 
sequence shown in SEQ ID NO. 1, 

(b) a nucleotide sequence corresponding to the 
sequence from (a) within the scope of the 
degeneracy of the genetic code, 

(c) a nucleotide sequence corresponding to an 
allelic derivative of one of the sequences from (a) 
and (b) , or 

(d) a nucleotide sequence hybridizing with one of 
the sequences from (a) , (b) or (c) 

provided that a DNA molecule according to (d) 
completely contains at least the part coding for a 
mature protein of the TGF-B family. 

2. Vector, 
wherein 

it contains at least one copy of a DNA molecule as 
claimed in claim 1. 

3. Host cell, 
wherein 

it is transformed by a DNA as claimed in claim 1 or 
by a vector as claimed in claim 2. 



Host cell as claimed in claim 3, 
wherein 

it is a bacterium, a fungus, a plant or an animal 
cell. 

Protein of the TGF-6 family which is coded by a DNA 
sequence as claimed in claim 1. 

Protein as claimed in claim 5, 
wherein 

it has the amino acid sequence shown in SEQ ID 
NO. 2 or, if desired, functional parts thereof. 

Process for the production of a protein of the 

TGF-B family, 

wherein 

a host cell as claimed in claim 3 or 4 is cultured 
and the TGF-6 protein is isolated from the cell 
or/ and from the culture supernatant. 

Pharmaceutical composition, 
wherein 

it contains at least one protein as claimed in 
claim 5 or 6 as the active substance if desired, 
together with the usual pharmaceutical carrier 
substances, auxiliary substances, diluents or 
fillers. 

Pharmaceutical composition as claimed in claim 8 
for the treatment or prevention of damage to bone, 
cartilage, connective tissues, skin, mucous 
membranes, epithelium or teeth, for application in 
dental implants and for application in wound- 
healing and tissue regeneration processes. 
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10. Antibody or antibody .fragments, 
wherein 

they bind to a protein as claimed in claim 5 or 6. 



ABSTRACT 



The invention concerns a protein of the TGF-6 family, 
the DNA coding therefor and a pharmaceutical compost, 
containing the protein. 
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Figure 1 • 
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Figure 2a 
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Figure 2b 
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M: prestained protein molecular weight marker with the stated apparent 
molecular weights listed (Gibco BRL #26041-020) 

1: Cell culture supernatant (100 ui) after infection with recombinant viruses 
(with inserted MP52 cDNA) under reducing (1 % B-mercaptoethanol) 
conditions 

2: Cell culture supernatant (100 pi) after infection with wild-type viruses 
(without inserted foreign DNA) under reducing (1 % B-mercaptoethanol) 
conditions 

3: Cell culture supernatant (500 pi) after infection with recombinant viruses 
(with inserted MP52 cDNA) under non-reducing conditions 

4: Cell culture supernatant (500 pi) after infection with wild-type viruses 
(without inserted foreign DNA) under non-reducing conditions 




Figure 5 



Section through the entire implant (26 days after implantation) stained 
yon 

according to>Kossa. Mineralized tissue in black is cleariy distinguishable 
from the surrounding muscle tissue. 



Figure 6 



Cross-section of the implant (26 days after implantation) stained accord.ng 
to Masson-Goidner. 

1: border consisting of osteoblasts (pink in the original) 
2- osteoid (red in the original) 

3: mineralized bone tissue (green in the original) with osteocytes 

(pink in the original ) 
4: bone marrow (slight pink to orange in the original ) 



Fig.2a 



SEQ ID NO 19 
SEQ ID NO 20 
SEQ ID NO 21 
SEQ ID NO 22 
SEQ ID NO 23 
SEQ ID NO 24 
SEQ ID NO 25 
SEQ ID NO 26 
SEQ ID NO 27 
SEQ ID NO 28 
SEQ ID NO 29 



OD 



Eco RI Nco I 

ATGAATTCCCATGGACCTGGGCTGGMAKGAMTGGAT 



BMP 2 
BMP 3 
BMP 4 
BMP 7 
TGF-(31 
TGF-j32 
TGF-f33 
INHIBIN a 
INHIBIN 
INHIBIN (3b 



ACGTGGGGTGGAATGACTGGAT 
ATATTGGCTGGAGTGAATGGAT 
ATGTGGGCTGGAATGACTGGAT 
ACCTGGGCTGGCAGGACTGGAT 
AGGACCTCGGCTGGAAGTGGAT 
GGGATCTAGGGTGGAAATGGAT 
AGGATCTGGGCTGGAAGTGGGT 
AGCTGGGCTGGGAACGGTGGAT 
ACATCGGCTGGAATGACTGGAT 
TCATCGGCTGGAACGACTGGAT 



Fig.2b 



SEQ ID NO 30 
SEQ ID NO 31 
SEQ ID NO 32 
SEQ ID NO 33 
SEQ ID NO 34 
SEQ ID NO 35 
SEQ ID NO 36 
SEQ ID NO 37 
SEQ ID NO 38 
SEQ ID NO 39 
SEQ ID NO 40 



OlD 



Eco RI 

ATGAATTCGAGCTGCGTSGGSRCACAGCA 



BMP 2 GAGTTCTGTCGGGACACAGCA 

BMP 3 CATCTTTTCTGGTACACAGCA 

BMP 4 CAGTTCAGTGGGCACACAACA 

BMP 7 GAGCTGCGTGGGCGCACAGCA 

TGF-(31 CAGCGCCTGCGGCACGCAGCA 

T GF-(32 TAAATCTTGGGACACGCAGCA 

TGF-(33 CAGGTCCTGGGGCACGCAGCA 

INHIBIN a CCCTGGGAGAGCAGCACAGCA 

INHIBIN pA CAGCTTGGTGGGCACACAGCA 

INHIBIN (3b CAGCTTGGTGGGAATGCAGCA 



Fig.3 
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Fig.4 
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SEQ ID NO.l 

TYPE OF SEQUENCE: nucleic acid sequence 
NAME AND ORIGIN: MP- 52 DNA 
LENGTH: 2703 nucleotides 



TITAGACAGC&rGACATC 
GTGIGICTCHGIXJimGI^^ 



TOCCTOOXTIAm<SAGCCT^^ 



X3CTGC 

CGCIGITCTCTTIGCTGK^^ 



ACAGCTAT3CT3C33 
QGSGCQ^CCAArGCCAASa^ 
CAGCCCAAGAAC53A3GAACXX3WW^ 
CO^GGACACCCIKjrCAAACA^^ 
CTimiSGAGGCAAGGCACCCa^^^ 
AAGGCCAQGGAGCCCQGGC^^ 
ACACCCCAaGACTACAianC^^ 
GGCAACAGCAGaSTCAAGr^^ 
AAAGGGCAAGATaAOISAG^^ 
GOCCTGGAGAAGGATG33CiaZIX3G33Ga 



GSATCLOSCiaSGAQGKnTCGAC^ 

GscTiu^casascoGaxx^^ 

ACCAAGAAA03GGACCTU1UU111AATGAGATTAAGG 



CIOSCICICIGACTGIGJVZ^ 



GTCCCACIGCCTCCICAAATCAQ 

GIGAOGCIGITftGACIGTT&Sa^^ 
CCT 



SEQ ID NO: 2 

TYPE OF SEQUENCE: amino acid sequence 
NAME AND ORIGIN: MP -52 protein 
LENGTH: 501 amino acids 



MRLPKLLTFL LWYLAWLDLE FICTVLGAPD LGQRPQGTRP GLAKAEAKER 
PPLARNVFRP GGHS YGGGAT NANARAKGGT GQTGGLTQPK KDEPKKLPPR 
PGGPEPKPGH PPQTRQATAR TVTPKGQLPG GKAPPKAGSV PSSFLLKKAR 
EPGPPREPKE PFRPPPITPH EYMLSLYRTL SDADRKGGNS SVKLEAGLAN 
TITSFIDKGQ DDRGPWRKQ RYVFD I SALE KDGLLGAELR ILRKKPSDTA 
KPAAPGGGRA AQLKLSSCPS GRQPASLLDV RSVPGLDGSG WEVFDIWKLF 
RNFKNSAQLC LELEAWERGR AVDLRGLGFD RAARQVHEKA LFLVFGRTKK 
RDLFFNE I KA RSGQDDKTVY EYLFSQRRKR RAPLATRQGK RPSKNLKARC 
SRKALHVNFK DMGWDDWI I A PLEYEAFHCE GLCEFPLRSH LEPTNHAVIQ 
TLMNSMDPES TPPTCCVPTR LSPISILFID SANNWYKQY EDMWESCGC R 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCATGGCCTC GAAAGGGCAG CGGTGATTTT TTTCACATAA ATATATCGCA CTTAAATGAG 60 

TTTAGACAGC ATGACATCAG AGAGTAATTA AATTGGTTTG GGTTGGAATT CCGTTTCCAA 120 

TTCCTGAGTT CAGGTTTGTA AAAGATTTTT CTGAGCACCT GCAGGCCTGT GAGTGTGTGT 180 

GTGTGTGTGT GTGTGTGTGT GTGTGTGTGA AGTATTTTCA CTGGAAAGGA TTCAAAACTA 24 0 

GGGGGAAAAA AAAACTGGAG CACACAGGCA GCATTACGCC ATTCTTCCTT CTTGGAAAAA 300 

TCCCTCAGCC TTATACAAGC CTCCTTCAAG CCCTCAGTCA GTTGTGCAGG AGAAAGGGGG 360 

CGGTTGGCTT TCTCCTTTCA AGAACGAGTT ATTTTCAGCT GCTGACTGGA GACGGTGCAC 420 

GTCTGGATAC GAGAGCATTT CCACTATGGG ACTGGATACA AACACACACC CGGCAGACTT 48 0 

CAAGAGTCTC AGACTGAGGA GAAAGCCTTT CCTTCTGCTG CTACTGCTGC TGCCGCTGCT 54 0 

TTTGAAAGTC CACTCCTTTC ATGGTTTTTC CTGCCAAACC AGAGGCACCT TTGCTGCTGC 600 

CGCTGTTCTC TTTGGTGTCA TTCAGCGGCT GGCCAGAGGA TGAGACTCCC CAAACTCCTC 660 

ACTTTCTTGC TTTGGTACCT GGCTTGGCTG GACCTGGAAT TCATCTGCAC TGTGTTGGGT 720 

GCCCCTGACT TGGGCCAGAG ACCCCAGGGG ACCAGGCCAG GATTGGCCAA AGCAGAGGCC 780 

AAGGAGAGGC CCCCCCTGGC CCGGAACGTC TTCAGGCCAG GGGGTCACAG CTATGGTGGG 84 0 

GGGGCCACCA ATGCCAATGC CAGGGCAAAG GGAGGCACCG GGCAGACAGG AGGCCTGACA 900 

CAGCCCAAGA AGGATGAACC CAAAAAGCTG CCCCCCAGAC CGGGCGGCCC TGAACCCAAG 960 

CCAGGACACC CTCCCCAAAC AAGGCAGGCT ACAGCCCGGA CTGTGACCCC AAAAGGACAG 1020 

CTTCCCGGAG GCAAGGCACC CCCAAAAGCA GGATCTGTCC CCAGCTCCTT CCTGCTGAAG 1080 

AAGGCCAGGG AGCCCGGGCC CCCACGAGAG CCCAAGGAGC CGTTTCGCCC ACCCCCCATC 114 0 

ACACCCCACG AGTACATGCT CTCGCTGTAC AGGACGCTGT CCGATGCTGA CAGAAAGGGA 1200 

GGCAACAGCA GCGTGAAGTT GGAGGCTGGC CTGGCCAACA CCATCACCAG CTTTATTGAC 1260 

AAAGGGCAAG ATGACCGAGG TCCCGTGGTC AGGAAGCAGA GGTACGTGTT TGACATTAGT 1320 

GCCCTGGAGA AGGATGGGCT GCTGGGGGCC GAGCTGCGGA TCTTGCGGAA GAAGCCCTCG 1380 

GACACGGCCA AGCCAGCGGC CCCCGGAGGC GGGCGGGCTG CCCAGCTGAA GCTGTCCAGC 14 4 0 
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TGCCCCAGCG GCCGGCAGCC GGCCTCCTTG CTGGATGTGC GCTCCGTGCC AGGCCTGGAC 1500 

GGATCTGGCT GGGAGGTGTT CGACATCTGG AAGCTCTTCC GAAACTTTAA GAACTCGGCC 1560 

CAGCTGTGCC TGGAGCTGGA GGCCTGGGAA CGGGGCAGGG CCGTGGACCT CCGTGGCCTG 162 0 

GGCTTCGACC GCGCCGCCCG GCAGGTCCAC GAGAAGGCCC TGTTCCTGGT GTTTGGCCGC 1680 

ACCAAGAAAC GGGACCTGTT CTTTAATGAG ATTAAGGCCC GCTCTGGCCA GGACGATAAG 17 4 0 

ACCGTGTATG AGTACCTGTT CAGCCAGCGG CGAAAACGGC GGGCCCCACT GGCCACTCGC 1800 

CAGGGCAAGC GACCCAGCAA GAACCTTAAG GCTCGCTGCA GTCGGAAGGC ACTGCATGTC 18 60 

AACTTCAAGG ACATGGGCTG GGACGACTGG ATCATCGCAC CCCTTGAGTA CGAGGCTTTC 1920 

CACTGCGAGG GGCTGTGCGA GTTCCCATTG CGCTCCCACC TGGAGCCCAC GAATCATGCA 1980 

GTCATCCAGA CCCTGATGAA CTCCATGGAC CCCGAGTCCA CACCACCCAC CTGCTGTGTG 204 0 

CCCACGCGGC TGAGTCCCAT CAGCATCCTC TTCATTGACT CTGCCAACAA CGTGGTGTAT 2100 

AAGCAGTATG AGGACATGGT CGTGGAGTCG TGTGGCTGCA GGTAGCAGCA CTGGCCCTCT 2160 

GTCTTCCTGG GTGGCACATC CCAAGAGCCC CTTCCTGCAC TCCTGGAATC ACAGAGGGGT 2220 

CAGGAAGCTG TGGCAGGAGC AT CT AC AC AG CTTGGGTGAA AGGGGATTCC AATAAGCTTG 2280 

CTCGCTCTCT GAGTGTGACT TGGGCTAAAG GCCCCCTTTT ATCCACAAGT TCCCCTGGCT 234 0 

GAGGATTGCT GCCCGTCTGC TGATGTGACC AGTGGCAGGC ACAGGTCCAG G GAG AC AG AC 24 00 

TCTGAATGGG ACTGAGTCCC AGGAAACAGT GCTTTCCGAT GAGACTCAGC CCACCATTTC 24 60 

TCCTCACCTG GGCCTTCTCA GCCTCTGGAC TCTCCTAAGC ACCTCTCAGG AGAGCCACAG 2520 

GTGCCACTGC CTCCTCAAAT CACATTTGTG CCTGGTGACT TCCTGTCCCT GGGACAGTTG 2580 

AGAAGCTGAC TGGGCAAGAG TGGGAGAGAA GAGGAGAGGG CTTGGATAGA GTTGAGGAGT 2640 

GTGAGGCTGT TAGACTGTTA GATTTAAATG TATATTGATG AGATAAAAAG CAAAACTGTG 27 00 

CCT 2703 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Leu Pro Lys Leu Leu Thr Phe Leu Leu Trp Tyr Leu Ala Trp 
15 10 15 

Leu Asp Leu Glu Phe lie Cys Thr Val Leu Gly Ala Pro Asp Leu Gly 
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Gin Arg Pro Gin Gly Thr Arg 
35 



Glu Arg Pro Pro Leu Ala Arg 
50 55 



Tyr Gly Gly Gly Ala Thr Asn 
65 70 



Gly Gin Thr Gly Gly Leu Thr 
85 



Leu Pro Pro Arg Pro Gly Gly 
100 



Pro Gly Leu Ala Lys Ala 
40 45 
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Glu Ala Lys 



Asn Val Phe Arg Pro Gly Gly His Ser 
60 



Ala Asn Ala Arg Ala Lys 
75 

Gin Pro Lys Lys Asp Glu 
90 

Pro Glu Pro Lys Pro Gly 
105 



Gly Gly Thr 
80 



Pro Lys Lys 
95 



His Pro Pro 
110 



Gin Thr Arg Gin Ala Thr Ala 
115 



Pro Gly Gly Lys Ala Pro Pro 
130 135 



Arg Thr Val Thr Pro Lys 
120 125 



Lys Ala Gly Ser Val Pro 
140 



Gly Gin Leu 
Ser Ser Phe 



Leu Leu Lys Lys Ala Arg Glu 
145 150 



Pro Gly Pro Pro Arg Glu 
155 



Pro Lys Glu 
160 



Pro Phe Arg Pro Pro Pro lie 
165 



Thr Pro His Glu Tyr Met 
170 



Leu Ser Leu 
175 



Tyr Arg Thr Leu Ser Asp Ala 
180 



Asp Arg Lys Gly Gly Asn Ser Ser Val 
185 190 



Lys Leu Glu Ala Gly Leu Ala 
195 



Gly Gin Asp Asp Arg Gly Pro 
210 215 



Asn Thr lie Thr Ser Phe 
200 205 



Val Val Arg Lys Gin Arg 
220 



lie Asp Lys 
Tyr Val Phe 



Asp lie Ser Ala Leu Glu Lys 
225 230 



lie Leu Arg Lys Lys Pro Ser 
245 



Asp Gly Leu Leu Gly Ala 
235 



Asp Thr Ala Lys Pro Ala 
250 



Glu Leu Arg 
240 



Ala Pro Gly 
255 



Gly Gly Arg Ala Ala Gin Leu 
260 



Lys Leu Ser Ser Cys Pro 
265 



Ser Gly Arg 
270 



Gin Pro Ala Ser Leu Leu Asp 
275 



Ser Gly Trp Glu Val Phe Asp 
290 295 



Val Arg Ser Val Pro Gly 
280 285 



lie Trp Lys Leu Phe Arg 
300 



Leu Asp Gly 
Asn Phe Lys 



Asn Ser Ala Gin Leu Cys Leu 
305 310 



Ala Val Asp Leu Arg Gly Leu 
325 



Glu Leu Glu Ala Trp Glu 
315 



Gly Phe Asp Arg Ala Ala 
330 



Arg Gly Arg 
320 



Arg Gin Val 
335 



His Glu Lys Ala Leu Phe Leu Val Phe Gly Arg Thr Lys Lys Arg Asp 
340 345 350 

Leu Phe Phe Asn Glu lie Lys Ala Arg Ser Gly Gin Asp Asp Lys Thr 
355 360 365 

Val Tyr Glu Tyr Leu Phe Ser Gin Arg Arg Lys Arg Arg Ala Pro Leu 
370 375 380 

Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu Lys Ala Arg Cys 
385 390 395 400 

Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly Trp Asp Asp 
405 410 415 

Trp lie lie Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu 
420 425 430 

Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala Val 
435 440 445 

lie Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr 
450 455 460 

Cys Cys Val Pro Thr Arg Leu Ser Pro lie Ser He Leu Phe He Asp 
465 470 475 480 

Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu 
485 490 495 

Ser Cys Gly Cys Arg 
500 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGAATTCGCA TGCCATGGTC GACG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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CTTGAGTACG AGGCTTTCCA CTG 

(2) INFORMATION FOR SEQ ID NO: 5: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATTCGCATGC CATGGTCGAC GAAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGAGCCCACG AATCATGCAG TCA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACAGCAGGTG GGTGGTGTGG ACT 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGAATTCGCA TGCCATGGTC GACGAAGCTT TTTTTTTTTT TTTT 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCAGCAGCCC ATCCTTCTCC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TCCAGGGCAC TAATGTCAAA CACG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTAATGTCA AACACGTACC TCTG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGCGGCCGCT 

(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly Trp Asp 
15 10 15 

Asp Trp He He Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly 
20 25 30 

Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 
35 40 45 

Val He Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro 
50 55 60 

Thr Cys Cys Val Pro Thr Arg Leu Ser Pro He Ser He Leu Phe He 
65 70 75 80 

Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 
85 90 95 



Glu Ser Cys Gly Cys Arg 
100 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 60 

(C) OTHER INFORMATION: /note= "Xaa is a gap inserted only for 
comparison purposes" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly 
20 25 30 

Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Xaa Lys He Pro Lys 
50 55 60 

Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu 
65 70 75 80 

Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val 
85 90 95 

Glu Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 60 

(C) OTHER INFORMATION: /note= "Xaa is a gap inserted only for 
comparison purposes" 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 
15 10 15 

Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly 
20 25 30 

Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val Asn Ser Val Asn Ser Xaa Ser He Pro Lys 
50 55 60 

Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu 
65 70 75 80 

Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val 
85 90 95 

Glu Gly Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Arg Lys His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly 
20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Leu Met Asn Pro Glu Tyr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
15 10 15 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr Asn His Ala 
35 40 45 

He Val Gin Thr Leu Val His Phe He Asn Pro Glu Thr Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val Leu Tyr Phe 
65 70 75 80 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
85 90 95 

Arg Ala Cys Gly Cys His 
100 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
ATGAATTCCC ATGGACCTGG GCTGGMAKGA MTGGAT 
(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
ACGTGGGGTG GAATGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
ATATTGGCTG GAGTGAATGG AT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATGTGGGCTG GAATGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
ACCTGGGCTG GCAGGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AGGACCTCGG CTGGAAGTGG AT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGGATCTAGG GTGGAAATGG AT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGGATCTGGG CTGGAAGTGG GT 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGCTGGGCTG GGAACGGTGG AT 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACATCGGCTG GAATGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TCATCGGCTG GAACGACTGG AT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
ATGAATTCGA GCTGCGTSGG SRCACAGCA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GAGTTCTGTC GGGACACAGC A 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATCTTTTCT GGTACACAGC A 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CAGTTCAGTG GGCACACAAC A 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GAGCTGCGTG GGCGCACAGC A 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAGCGCCTGC GGCACGCAGC A 



(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TAAATCTTGG GACACGCAGC A 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
CAGGTCCTGG GGCACGCAGC A 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CCCTGGGAGA GCAGCACAGC A 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CAGCTTGGTG GGCACACAGC A 
(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



CAGCTTGGTG GGAATGCAGC A 
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